Attorney's Docket No.: 13425-047001 / 00357-US 



APPLICATION 



FOR 



UNITED STATES LETTERS PATENT 



TITLE: NEW NUCLEAR MAGNETIC RESONANCE SCREENING 

METHOD 

APPLICANT: JOHAN WEIGELT AND MATS WIKSTR6M 



CERTIFICATE OF MAILING BY EXPRESS MAIL 

Express Mail Label No. EL 856746271 US 

I hereby certify under 37 CFR §1.10 that this correspondence is being 
deposited with the United States Postal Service as Express Mail Post 
Office to Addressee with sufficient postage on the date indicated below 
and is addressed to the Commissioner for Patents, Washington, 
D.C 20231. 




Attorney's Docket No. 13425-047001 / 00357-US 
NEW NUCLEAR MAGNETIC RESONANCE SCREENING METHOD 

Cross Reference to Related Applications 
This application claims priority from Swedish Patent Application No. 00038 1 1-7, 
filed October 20 5 2000, and U.S. Provisional Patent Application Serial No. 60/243,626, 
filed October 26, 2000. These applications are incorporated herein by reference in their 
entirety. 

Technical Field 

The present invention relates to a nuclear magnetic resonance (NMR) based 
method for assaying binding of various chemical compounds to a polypeptide or a pro- 
tein. More specifically, the method allows for the screening of binding to a designated 
binding epitope on the surface of the polypeptide or protein. 

Technical Background 

In modern biology and medicine, there has been a demand for physical methods, 
which can make it possible to study the structure of small and large biomolecules, as well 
as the interaction between various molecules and compounds. For this purpose, several 
powerful techniques have been applied, such as x-ray crystallography, mass spectrometry, 
and nuclear magnetic resonance (NMR). 

In recent years, NMR spectroscopy has become an important tool in the drug dis- 
covery process through the advent of NMR based screening methods to identify lead tem- 
plates. Several techniques have been introduced, [Shuker, 1996; Chen, 1998; Mayer, 
1999; Chen, 2000; Jahnke, 2000] perhaps the most well known is the n SAR by NMR" 
method described by Fesik and coworkers in 1996. [W097/18471, Shuker, 1996 and EP- 
B 1-0866967] The SAR by NMR technique relies on detecting chemical shift changes in a 
two-dimensional l H.- ls N correlation spectrum to identify compounds that bind to the tar- 
get protein. When a first ligand has been identified, a second ligand is sought for in the 
presence of saturating concentrations of the first ligand (screen for "second-site" binder). 
Provided the three dimensional structure of the protein is known and sequence specific 
NMR assignments of the protein backbone resonances have been obtained, the two 
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ligands may be linked based on structural data. To first approximation the binding affinity 
of the linked compound will be the product of the binding affinities of the individual 
compounds, resulting in high affinity "nanomolar binders" even if the starting compounds 
only had millimolar to micromolar affinities. [Shuker, 1996] 

A prerequisite for the S AR by NMR method is that sequence specific resonance 
assignments have been obtained for the backbone NMR resonances ( 15 N, 13 C a , I3 C, *H N ) 
of the target protein. This is a formidable task that demands several months of experimen- 
tal work and data analysis even for a relatively small protein. 

For structural studies of large proteins, Yabuki et aL, 1998, discloses a method, 
which method involves the site-specific labeling of two in the peptide sequence following 
amino acids with 13 C and 15 N, respectively. This is done in a so-called cell-free system, 
in order to avoid problems of disturbing incorporations. Hereby, due to 1 JcN-coupling it is 
possible to study only one signal in an adequate spectrum. This article is mainly focused 
on the development of methods for cell-free synthesis for structural studies of the human 
c-Ha-Ras protein, as well as protein-protein interactions. 

Accordingly, the known techniques for screening for small binder molecules to a 
specific site in a target protein, such as an active site, pose some disadvantages, in that 
they are time-consuming and involve a lot of experimental work as well as complex in- 
terpretation of data achieved. Thus, there is a need for a method allowing identification of 
binder molecules to a specific target in an easier and more effective way, thereby limiting 
the experimental work. 

The object of the invention is to reduce the drawbacks and expand the possibilities 
of the prior art. 

Summary of the Invention 
This object is, according to the invention, solved by a method for identifying at 
least one binder molecule comprising the steps of: 

(a) choosing two amino acid types (AA1 and AA2) in a polypeptide or protein of in- 
terest, whereby AA2 at least once occurs directly subsequent to AA1 in the amino 
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acid sequence of the polypeptide or protein, defining an amino acid pair AA1- 
AA2; 

(b) labeling the two amino acid types (AA1 and AA2) in the polypeptide or protein of 
interest, whereby all AA1 -residues is labeled with 13 C and all AA2-residues with 
15 N; 

(c) generating a first HNCO-type NMR spectrum of the labeled polypeptide or protein 
from step (b), thereby identifying signals from the labeled amino acid pair AA1- 
AA2; 

(d) contacting the labeled polypeptide or protein with a potential binder molecule or a 
mixture of binder molecules under conditions and sufficient time for allowing 
binding of the potential binder molecule(s) and the labeled polypeptide or protein; 

(e) generating a second HNCO-type NMR spectrum, or a ^-^N correlation type 
NMR spectrum, of the mix from step (d), monitoring signals identified in step (c); 

(f) comparing the first and the second NMR spectra, whereby a chemical shift change 
of the signals identified in step (c) between the two spectra indicates an interaction 
between the potential binder molecule and the labeled polypeptide or protein. 

Hereby, by using efforts from the structural studies of large proteins in the field of 
drug molecule screening, the inventors have unexpectedly provided a method, which al- 
lows an easy identification of binder molecules to a target molecule. 

Yet another aspect of the invention is a method, whereby the labeled amino acid 
pair AA1-AA2 is unique within a sphere radius of 10 A, preferably 50 A, and most pref- 
erably within the whole polypeptide or protein. 

One embodiment of the invention is a method, whereby the labeled amino acid 
pair AA1-AA2 is within a binding pocket of the polypeptide or protein. This allows for 
the screening for binding to a single binding site. The requirement is that a unique amino 
acid pair occurs in the binding pocket. 

Another embodiment of the invention is a method, whereby the labeled amino 
acid pair AA1-AA2 is in the proximity, preferably closer than 15 A, of an active site 
within the polypeptide or protein. This enables the screening for specificity in vicinity of 
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an active site in the polypeptide or protein. If the target protein has one or more unique 
amino acid pairs in the vicinity of an active site, whereby the unique amino acid pairs dif- 
fers from other members of the same protein family, this can be used to screen for selec- 
tivity, i.e. the different proteins of the same family can be compared. Different com- 
pounds may give different chemical shift, why the longest distance from the active site, 
which makes specificity screening possible, varies. 

Yet another embodiment of the invention is a method, whereby the result of the 
method is compared to the result of any other suitable binding or activity assay, such as a 
fluorescence-based assay, a reporter gene assay, displacement assays or ELISA. This al- 
lows for a rapid confirmation of the binding to this specifically labeled site, or as a selec- 
tion of candidates for more extensive study by any other suitable method. 

Accordingly, according to another aspect of the invention, the method is used for 
screening of a compound library. 

All publications, patent applications, patents, and other references mentioned 
herein are incorporated by reference in their entirety. 

Short Description of the Drawings 

Figure 1 shows the site specific labeling of the PTPlB-protein. All arginine and 
aspartate residues have been enriched with 13 C and 15 N, respectively. There is only one 
unique Arg-Asp pair in the sequence (boxed). 

Figure 2 shows 2D NMR spectra of 15 N-Asp, 13 C-Arg labeled PTP1B: (A) 2D 
TROSY of free protein; (B) 2D HNCO of free protein; and (C-F) 2D TROSY spectra of 
(C) free protein; (D) protein + compound cocktail; (E) protein + compound cocktail omit- 
ting PNU179983; and (F) protein + PNU179983. 

Figure 3 is a ribbon drawing of PTP1B showing the five selectively labelled 
amino acid residue pairs. The "Y-loop pairs" Arg47-Asp48 (I) and Arg43-Asn44 (II) are 
shown, as well as Argl56-Glnl57 (III), Arg 169-Glul70 (IV) and Argl99-Glu200 (V), 
and the active site cysteine (VI). 

Figure 4 is a table showing the result of a colorimetric assay. All compounds 
were tested in a colorimetric assay. Samples containing 1 pM PTP1B, 2 mM para- 



Attorney's Docket No. 13425-047001 /00357-US 



nitrophenylphosphate (substrate) and 1 mM compound were prepared in 20 mM Tris-HCl 
buffer at pH 7.5 and incubated at room temperature for 5 minutes. The reaction was 
stopped by addition of NaOH to raise the pH. The amount of inhibition was calculated 
from the measured absorbance of the resulting product (para-nitrophenole). 
5 Figure 5 is a stereo-view of PNU 179983 binding to PTP1B as determined from a 

preliminary x-ray crystallographic electron density map obtained at 2.2 A resolution. 
PNU 179983 clearly interact not only with the active residues but also have close contacts 
with the Y-loop (highlighted) via hydrogen bonds. The closest distance between 
PNU1 79983 ant the Y-loop is approximately 2.8 A. The crystals were grown in the pres- 
10 ence of 2 mM inhibitor in 0.1 M pH 6.5 cacodylate buffer, 0.2 M Mg acetate, 16-20 % 
p PEG 8000. The crystals belonged to space group P222 with cell dimensions a=53.2 A, 

*fi b=84.3 A, c=88.7 A, alpha=beta=gamma=90. Data extended to 2.2 A with Rsym=6.7%. 

5 Data was processed and the model refined to R=25%. 

fy Figure 6 is a schematic view of one embodiment of the invention (screening for 

% 1 5 binding to a single binding site). An exemplary target sequence is shown (SEQ ID 
NO:3). 

q Figure 7 is a schematic view of another embodiment of the invention (screening 

% for specificity in vicinity of an active site). An exemplary target sequence, and exemplary 

O protein sequences x, y, and z are also shown (SEQ ID NOs:4-7, respectively). 

20 Figure 8 shows a NMR-spectra recorded on a 200 uM 13C'-Val, 1 5N-Ala selec- 

tively labeled sample of M-FABP. The spectra were recorded at 20 °C on a 600 MHz 
Varian Unity INOVA NMR spectrometer using standard techniques. The sample was 
prepared in 20 mM sodium phosphate buffer (10 % D 2 O/90 % H 2 0) pH 7.4 containing 50 
mM NaCl, 10 mM DTT and 0.02 % NaN 3 . NmrPipe (Delaglio et al., 1995) was used for 
25 data processing, (a) 2D l H- 15 N fast-HSQC (Mori et al., 1 995), (b) 2D HNCO and (c) ID 
HNCO. The HNCO spectra in panels b and c identify the resonance corresponding to 
alanine 33 (Roman numeral II in panel a). Experimental times were 1 hour, 4 hours, and 
20 minutes respectively. 

Figure 9 shows plots of 'H- 15 N fHSQC spectra for M-FABP recorded at 600 MHz 
30 showing the region corresponding to the alanine 33 cross peak of Figure 8a. Experimental 
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conditions were identical to those described in Figure 8 except the samples used for spec- 
tra b-d contained additionally 1 % d-DMSO. (a-d) Reference spectrum of free M-FABP. 
(b) 1:1 mixture of M-FABP and compound cocktail, (c) 1:1 mixture M-FABP and com- 
pound cocktail omitting oleic acid, (d) 1:1 mixture of M-FABP and oleic acid. 

Detailed Description of the Invention 

By a "compound library" is meant a set of chemical compounds, that for example 
can be used for assaying binding to a macromolecule. The compound library may com- 
prise from about 2 - 100.000 compounds, preferably 100-1000. The compounds of the 
library can be of any size, preferably 50-1000 D. The compounds can be any organic 
molecules or natural products. 

By a "polypeptide or protein of interest" is meant any amino acid sequence, or a 
complex of amino acid sequences, having a potential binding epitope. 

By a "potential binder molecule" is meant a chemical compound, which may bind 
to the polypeptide or protein of interest, and which may be a member of the compound 
library. The potential binder molecule may be a peptide, a polypeptide, a protein, an anti- 
body, a nucleic acid molecule, a carbohydrate, or a part or a complex of one or more of 
the mentioned molecule types, or any other chemical compound of interest. Preferably, 
the potential binder molecule is a relatively small molecule, such as a molecule in the size 
interval of 50-1 000 D. 

"AA1" and "AA2" can be any one of the about 20 different naturally occurring 
amino acid types. Furthermore, AA1 and/or AA2 may also be any modified variant of an 
amino acid, as long as it is possible to detect the 1 J C N-coupling. AA1 and AA2 may be 
the same or different type. 

By AA2 at least once occurs "directly subsequent" to AA1 in the amino acid se- 
quence is meant that AA1 in this case has the sequence position n-1 and AA2 sequence 
position n. 
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A "AAl-AA2-pair" is a pair of amino acids following each other directly in se- 
quence, whereby AA1 has the number n-1 and AA2 the number n in the amino acid se- 
quence. 

By a "binding pocket" is meant an epitope on the polypeptide or protein of inter- 
5 est, that is expected to allow binding of a potential binder molecule. 

By a "sphere radius" in this context is meant a defined distance in any direction in 
space from a defined pair of amino acids (AA1-AA2). 

The core of the invention is a revitalization of a sequence specific labeling method 
that has not been extensively used. [Kainosho, 1982, Kato, 1991, Yabuki, 1991] All 
10 amino acids AA1 are labeled with 13 C and all amino acids AA2 are labeled with 15 N. 

Provided only one AA1-AA2 pair occurs in the amino acid sequence, only one signal in 

€! the ID carbonyl 13 C spectrum will display a splitting due to the Ocn coupling. Obviously 

Ci 

JO only one peak will appear in a ID (or 2D or 3D) HNCO type correlation spectrum (Figure 

hi 

m l) - 

Hp 1 5 The labeling strategy is only sequence specific in an indirect sense, I e. the occur- 

r rence of a unique pair of labeled amino acid residues confers the sequence specificity. Us- 

ing this technique it is possible to screen selectively for binding to a selected epitope 
^ without the need for sequence specific assignments. The HNCO spectrum can thus be 

Q used either directly as a screening experiment (ID or 2D versions) or indirectly to identify 

p:i 20 what signals to monitor in a 2D *H- 15 N correlation spectrum. Chemical shift perturbations 
upon addition of a potential ligand are easily detected even for large proteins due to the 
reduced spectral complexity resulting from the use of a selectively labeled sample. 

The site specificity is inherent to the labeling technique and the key is to find a 
unique sequence motif in the binding site that is to be screened. Given a protein of 300 
25 amino acid residues, and assuming equal and random distribution of all 20 amino acid 
residues, the probability that a given amino acid residue pair only occurs once in the se- 
quence is approximately 0.5. If the desired site contains at least 3-4 suitable pairs, the 
probability of finding a unique pair is reasonably high, i.e. 85-95 %. 

The most reliable way to obtain a sequence specific labeled protein is to over- 
30 express the protein using cell-free synthesis as described by Yabuki et al. [Yabuki, 1998]. 
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Alternatively the protein could be over-expressed in rich medium containing the labeled 
amino acids, using a bacterial strain with gene lesions suitable for the particular type of 
selective amino acid enrichment [Muchmore, 1989]. In favorable cases however, good 
results have been reported using a prototrophic bacterial strain in a rich medium contain- 
5 ing all amino acids and nucleotides. [Kainosho, 1982] [Muchmore, 1989] 

According to one embodiment of the invention, the method can be used for 
screening for binding to a single binding site (figure 6). This can be done if the target pro- 
tein or polypeptide of interest has at least one unique amino acid pair (two amino acids 
being adjacent to each other in the primary structure) within the potential binding site. 
10 Then, this unique amino acid pair is labeled according to the method of the invention, and 

Q the interaction between the binding site and potential binder molecules can be studied. 

i.f| 

/■S According to another embodiment of the invention, the method can be used for 

"■t&i 

^ screening for specificity in vicinity of an active site (Figure 7). This can be achieved if the 

f|j target protein or polypeptide of interest has at least one unique amino acid pair in vicinity 

jS 15 of the active site (preferably closer than 1 5 A, and more preferably within 5-1 5 A). Fur- 

? ther, if the target protein differs from other proteins within the same protein family in that 

Q the target protein has at least one unique amino acid compared to the other proteins, this 

lp can be used to screen for selectivity of potential binder molecules to the target protein. 

|S This means that the unique amino acid, which differs from the other proteins of the fern- 

is?*:* 

20 ily, may be close to the active site with regard to tertiary structure, but not necessarily 

close with regard to primary structure. However, the unique amino acid should preferably 
not be within the active site. 

According to yet another embodiment of the invention, the method of the inven- 
tion can be used in combination with any other suitable binding or activity assay, such as 

25 a fluorescence based assay, a reporter gene assay, displacement assays or ELISA, in order 
to either confirm the result of that method, or to screen for interesting compounds, which 
subsequently are more thoroughly studied by another method. The method of the inven- 
tion can be advantageously used in this aspect as it provides a method, which rapidly 
gives a reliable result. This can be useful when there are no X-ray data for the studied 

30 protein, but only computer modelling data. 
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The conditions suitable for allowing the potential binder molecule and the labeled 
protein or polypeptide of interest to interact as well as to monitor the interaction by 
NMR, are standard conditions for protein NMR [Cavanagh] (buffered solutions, pH kept 
stable, reaction temperature 5-50°C). Preferably, the target protein concentration may 
5 range from 25 jaM - 1 mM, and the potential binder molecule concentration from 25 |iM 
-ImM, 

In order for a change in the chemical shift to be considered relevant, the change 
should be equal to or greater than the natural line width, or the signals should be ex- 
change-broadened beyond detection. 
10 Moreover, the method of the invention may be used for competition binding 

q analysis experiments. 

**j To summarize, the new technique presents a new approach to screen and identify 

ffl binders to protein targets in a site-specific manner. This will allow the identification of 

? j i scaffolds (by screening a small compound library) with a binding preference for a particu- 

+; 1 5 lar site or sites, which could confer specificity for a certain target within a protein family, 
s and also confirm the binding mode from hits detected using other suitable binding or ac- 

f j tivity assays, such as a fluorescence based assay, a reporter gene assay, displacement as- 

says or ELISA. 

O Also, the new technique has the potential to allow NMR studies of proteins of 

20 much larger size than traditional S AR-B Y-NMR since essentially no assignment of the 
protein resonances has to be undertaken. Conventionally, only proteins of the size up to 
approximately 30 kD are possible to study. With the method of the invention, proteins of 
the size 50 kD, and most probably of the size up to 100 kD, can be monitored. The sam- 
ple preparation can be prepared according to previously described protocols. [Muchmore 
25 1989, Yabuki 1998] 

For further details on NMR spectroscopy or on experimental approaches relevant 
in the field of the invention, W097/18471 and Cavanagh, 1996 are hereby incorporated 
as references. 
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Below, the invention is described by the following examples, which only are to be 
seen as exemplifying the invention, and not limiting the scope of the invention in any 
way. 

Examples 

Example 1: Screening for binding to the Y-loop of Protein Tyrosine Phosphatase- 
1B(PTP1B) (SEQIDNO:l). 

PTP1B (35 kD, single domain protein] is a protein that dephosphorylates phos- 
photyrosines. The active site binding cleft is centered around an active cysteine residue. It 
has been speculated that the so-called Y-loop of PTP1B is important for binding of some 
ligands. The Y-loop contains a sequence motif that is unique in the sequence, i.e. Arg47- 
Asp48 (figure 1). A selective labeled sample of PTP1B was prepared to monitor binding 
to this site. 

Site specific labeled PTP1B (residues 1-298) was prepared from transformed Es- 
cherichia coli strain BL21(DE3) cells. Bacteria were grown in rich medium containing all 
20 amino acids and 5 nucleotides according to the protocol described by Muchmore et al. 
[Muchmore, 1989] Aspartate and arginine were supplied 15 N-enriched and 13 C-enriched, 
respectively. It should be noted that this protocol (i.e. the use of a prototrophic bacterial 
strain) for preparing the sample is not optimal for the production of selectively 15 N-Asp 
enriched protein. Aminotransferase activity may cause misincorporation of 15 N in other 
amino acid residues. For the case of Asp, incorporation in Asn, Glu and Gin residues may 
be seen. [Muchmore, 1989] Nevertheless, the risk of misincorporation of the 15 N label is 
in no way a limitation to the invention. A selectively labeled protein may be produced 
with no misincorporation of the 15 N (or 13 C) label if a bacterial strain with gene lesions 
suitable to the particular type of selective amino acid enrichment is used. [Muchmore, 
1989] Alternatively, the sample could be prepared using cell-free synthesis [Yabuki, 
1998]. It should also be noted that in favorable cases good results have been reported also 
when prototrophic bacterial strains were used. [Kainosho, 1982] [Muchmore, 1989] 

A two-dimensional ! H- 13 C correlation spectrum of the selectively labeled sample 
verified that only Arg residues were 13 C-enriched. The corresponding l K- l5 N experiment 
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indicated, however, that 15 N was not exclusively incorporated in Asp residues. Figure 2A 
shows the 2D J H- 15 N TROSY [Weigelt, 1998] spectrum of the 15 N-Asp, 13 C-Arg sample. 
If only Asp residues were 15 N-labeled a total of 1 8 peaks would be expected in this 1 H- 
15 N correlation spectrum (18 Asp residues in the amino acid sequence). Instead the spec- 
trum contains approximately 65 peaks, which is consistent with incorporation of I5 N also 
at Asn, Glu and Gin sites (totally 69 in the amino acid sequence). Studying the amino acid 
sequence in detail one can see that there now instead of one unique pair, Arg47-Arg48, 
are four additional ones, Arg43-Asn44, Argl56-Glnl57, Argl69-Glul70, and Argl99- 
Glu200. The first of these, Arg43-Asn44, is located in the beginning of the Y-loop. The 
other three are situated on the protein surface far from the binding site (figure 3). 

Figure 2B shows the 2D HNCO [Cavanagh, 1996] spectrum that selects signals 
from the selectively labeled pairs. As expected, five peaks corresponding to the five 
unique sequence pairs appear in the spectrum (circled). An additional peak (marked with 
an arrow) originating from the C-terminal Asp residue is also seen. The natural abun- 
dance 13 C (1 %) of the preceding Glu residue is enough to yield an HNCO signal due to 
the extremely strong H N -resonance of the C-terminal Asp residue. 

To detect binding to the Y-loop one would expect that one or two of the desig- 
nated signals would experience large chemical shift changes upon addition of a com- 
pound that interacts with the Y-loop, whereas the other three signals would remain essen- 
tially unaffected. 

To test this hypothesis the inventors recorded 2D TROSY spectra in the presence 
of a cocktail of five compounds. One of the compounds, N200, was known to bind to the 
protein as determined by a NMR-line broadening assay (more than 5 Hz line broadening 
at 50 \xM concentration in the presence of an equimolar amount of PTP1B). Three com- 
pounds, N35, N136 and N212, were non-binders as determined by NMR. The fifth com- 
pound, PNU179983, was a micromolar inhibitor of PTP1B 

believed to interact with the Y-loop. Figure 2C shows a selected area from the TROSY 
spectrum of free 15 N-Asp, 13 C-Arg PTP1B. Figure 2D shows the spectrum in presence of 
the cocktail. Figure 2E shows the spectrum in presence of all compounds from the cock- 
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tail except PNU179983. Figure 2F shows the spectrum with only PNU179983 present. In 
all spectra the signals corresponding to the HNCO selected signals are circled. 

It is evident that substantial spectral changes occur in the presence of the com- 
pound cocktail (Figure 2D). In particular two of the designated peaks (marked with aster- 
isks) disappear altogether, whereas the other three signals only experience minor shifts. 
This indicates that at least one of the compounds in the cocktail interacts strongly with the 
Y-loop. When PNU179983 was omitted from the cocktail (Figure 2E) no shifts are ob- 
served for any of the designated signals. Figure 2F shows that PNU179983 is responsible 
for the large shifts since this spectrum shows the same pattern as the one recorded in 
presence of the full cocktail (Figure 2D). 

To further substantiate the finding that the invention allows identification of bind- 
ers to a selected site all compounds were tested in a colorimetric assay (Figure 4). N 200, 
which binds to PTP1B according to the NMR line-broadening assay, showed partial inhi- 
bition of the enzyme, indicating binding to the active site. N35 and N136 also showed 
partial inhibition, which would indicate that they were false negatives or interacting too 
weakly to indicate binding in the NMR line-broadening assay. PNU1 79983 showed com- 
plete inhibition of the enzyme. 

Furthermore, an x-ray crystallographic study was performed to assess the binding 
mode of PNU179983. Preliminary x-ray crystallographic data at 2.2 A resolution (Derek 
Ogg, unpublished results) showed that PNU179983 binds to PTP1B with interactions not 
only within the active site but also with the Y-loop (figure 5). The inhibitor is in close 
proximity to the Y-loop and interacts with the loop via hydrogen bonds. The shortest dis- 
tance between the inhibitor and the Y-loop is approximately 2.8 A. 

Example 2: Site selective screening of human muscle fatty acid binding protein 
(M-FABP)(SEQ ID NO:2). 

The principle of the site-selective screening method is demonstrated for the hu- 
man muscle fatty acid binding protein M-FABP. A 1 .4 A X-ray crystal structure of M- 
FABP in complex with a ligand, oleic acid, is available (PDB ID code: 1 HMS). [Young, 
1994] The structure consists of 10 anti-parallel p -strands and two a-helices that connect 
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|3-strands 1 and 2. The p-strands form two nearly orthogonal p-sheets. The fatty acid 
binding site is situated in a cavity between the two p-sheets. The cavity is also lined by 
residues from helices 1 and 2. The carboxylate group of the fatty acid forms hydrogen 
bonds (direct and solvent mediated) to the protein. The aliphatic tail of the fatty acid 
5 adopts a u-shaped conformation in the highly hydrophobic cavity. Alanine 33 makes con- 
tacts with carbons C12, C13 and C14 of the oleic acid. [Young, 1994] Valine 32 and 
alanine 33 comprise a unique amino acid residue pair in the amino acid sequence, and are 
thus suitable for selective labeling. It is likely that the NMR resonances of alanine 33 will 
be affected upon binding of oleic acid. A 13C-Val, 15N-Ala selectively labeled sample of 
10 M-FABP was produced from prototrophic Escherichia coli cells harboring a plasmid con- 
taining the gene coding for a 143 amino acid protein construct of M-FABP. The construct 

€1 encompassed a C-terminal his-tag included for purification purposes. Cells were grown in 

-CO 

m a fermentor using a growth medium containing all amino acids and nucleotides according 

1 J;! to the protocol of Muchmore et al. Valine was supplied with the carboxyl carbon 1 SC- 

SI 1 5 labeled, alanine was supplied 1 5N labeled (Cambridge Isotope Laboratories). Muchmore 
y ; et al advocates the use of auxotrophic bacterial strains with gene deletions suitable for 

f; the particular type of amino acid labeling that is desired. Even better performance is ex- 

€§ pected from cell-free synthesis [Yabuki, 1998]. Good results have, however, been re- 

£i ported also when prototrophic bacterial strains were used. [Kainosho, 1982; Kato, 1991 ; 

20 Muchmore, 1989; Senn, 1987] The protein was purified by affinity chromatography using 
a Ni2+ charged 5 ml HiTrap Chelating column (Amersham Pharmacia Biotech), followed 
by a Lipidex-1000 column (Sigma) as described by Constantine et al. 

The protein mass was determined using Electrospray Mass Spectrometry. The ex- 
tent of incorporation of 13 C and 15 N in valine and alanine residues was estimated to be at 
25 least 95 %. 

Figure 8 shows NMR spectra of selectively labeled M-FABP. The signals visible 
in a iH-^N correlation experiment (Figure 8a) correspond perfectly to the alanine reso- 
nances of M-FABP. [Constantine, 1998] The 2D HNCO experiment yields cross peaks 
only for ^C-^N- 1 !! moieties. There is only one such instance in the selectively labeled 
30 M-FABP protein, namely ( 13 C)Val32-( 15 N)Ala33. The HNCO spectrum in Figure 8b 
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contains only one cross peak which readily identifies the resonance corresponding to 
alanine 33, in agreement with the published resonance assignment. The spectrum dis- 
played in Figure 8c demonstrates the feasibility of the use of a ID HNCO experiment for 
detection. 

Five test compounds - oleic acid and four compounds from our NMR screening 
library showing no NMR line broadening when mixed with M-FABP - were mixed into a 
compound cocktail. Figure 9 displays spectral expansions of spectra recorded on selec- 
tively labeled M-FABP in different mixtures. 

As expected the HSQC cross peak belonging to alanine 33 experiences a chemical 
shift change upon addition of the test cocktail (Figure 9b). The same chemical shift 
change is observed when M-FABP is mixed with oleic acid alone (Figure 9d). Omitting 
oleic acid from the test cocktail leaves the spectrum unchanged (Figure 9c). Clearly the 
site-selective labeling method promises to be a valuable tool for identifying compounds 
with specific binding properties. Potential applications include: screening for binders to a 
selected site that has been identified either through x-ray, NMR or modeling studies; an 
assay to confirm binding of ligands identified by other methods (i. e. HTS) to a desired 
site; screening for ligands that bind to a site that could confer binding specificity for one 
target protein within a protein family. Due to the reduced spectral complexity resulting 
from the use of a selectively labeled sample, the method should be applicable to larger 
proteins than are conventional methods. 
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